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Abstract 



| The key step of generating the well-known Hsiao code is to construct a {0, l}-check-matrix in 

which each column contains the same odd- number of l's and each row contains the same number 
of l's or differs at most by one for the number of l's. We also require that no two columns 
are identical in the matrix. The author solved this problem in 1986 by introducing a type of 
recursively balanced matrices. However, since the paper was published in Chinese, the solution 
for such an important problem was not known by international researchers in coding theory. In 
this note, we focus on how to practically generate the check matrix of Hsiao codes. We have 
modified the original algorithm to be more efficient and effective. We have also corrected an 
error in algorithm analysis presented in the earlier paper. The result shows that the algorithm 
attained optimum in average cases if a divide-and-conquer technique must be involved in the 
OO ', algorithm. 



^ ■ 1 Introduction 

_ _ i 

Error-detections and corrections are required for computer main memory and secondary storages. In 
recent years, trusted computing and computing reliability have become more and more important in 
theory and practice. Data recovery is at the center of concerns. Error-corrections codes, especially 
Hamming codes and Hsiao codes are still essential to this type of technology [10] [9] . 

The well-known Hsiao code is the most energy saving Hamming Codes [7] [9] [4] [5] [6] . It has 
been widely used in memory fault tolerance for more than thirty decades. As a type of SEC- 
DED codes, i.e. single error correction and double-error detection codes, Hsiao codes attained the 
optimal in minimum odd-weight of columns. Even though there was an observation indicating that 
cellular automata-based codes might be better than Hsiao codes in terms of check bits for cellular 
automata -a future computing device [4] [2] . However, Hsiao code is still the most efficient code 
used in industry [9] [6] . 
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After a code is designed, the most important task is to find its check matrix. The author 
introduced an algorithm that recursively constructed the check-matrix to Hsiao codes in 1986 [1]. 
Unfortunately, the published paper was written in Chinese and the title of the paper did not 
mention the SEC-DED code. This algorithm was not known by international researchers in coding 
theory. 

In this paper, we first explain the algorithm described in [1] where it was addressed in an 
abstract and in a brief manner. Then, we modified some steps to show a more detailed algorithm. 
Finally, we corrected an error in the algorithm analysis of [1] and provided a more precise time 
analysis for general cases. Theoretically, both original algorithm and the modified algorithm are 
very efficient and fast in terms of complexity theory. We have proved that the modified algorithm 
is optimal in average cases. However, in practice, we may find an even more efficient algorithm. 



2 Major Steps of Generating Check Matrix H for Hsiao Code 

The definition of Hsiao code is a type of SEC-DED codes whose check matrix H defined on GF{2) 
satisfies: 

(1) Every column contains an odd number of l's. 

(2) The total number of l's reaches the minimum. 

(3) The difference of the number of l's in any two rows is not greater than 1. 

(4) No two columns are the same. 

Therefore, Hsiao called this code an optimal minimum odd-weight-column SEC-DED code. 

Now, we will discuss how to generate H. Assume that we want to generate the SEC-DED Code 
with k information (data) bits, first according to the general requirement of Hsiao codes [7], 



From this, we can determine R. After R is determined, we define A(R, J,m) a {0, l}-type R x m 
matrix with column weight J, < J < R. No two columns are the same in A(R, J, m). 



R>1 + log 2 (k + R) 



(1) 



A{R, 2 • i + 1, C 2 R i+1 ), i = 0,l, I -I. 



(2) 
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is the combinatorial number. Assume, 
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^C 2 R i+1 <(k + R)<^ i=0 C 2 R i+1 . (3) 
Let m = (k + R) - S^ 1 C^ +1 . We can see that the most important task is to generate 

A(R, 2 ■ I + 1, m) 

because the other cases are the special cases of it. 
Thus, 

H= [@lZoA(R,2 -t + l,C^ i+1 )]e A{R,2-I+l,m) , (4) 
where © is to union these arrays together, horizontally. 

3 The Algorithm of Generating A(i?, J, to) 

According to Section 2, it is not difficult to see that obtaining the matrix A(R, J,m) is the key to 
the check matrix H, especially when m / C R ^ +1 . 

Let's first define some notations: 

(a) A(R, J, m): a {0, l}-type R x m matrix with column weight J, < J < R. 

(b) < t > (m): a {0, l}-type 1 x m matrix in which every component is t. For example, < 1 > 
(2) = (1,1). 

(c) o: the matrix up-down union operator, i.e. N o M = ^ 

(d) ©: the matrix left-right union operator, i.e. N (B M = [NM] 

(e) ©: NWM is to place the matrix M upside down, do "ffi," and then move the rows that 
contain more l's to the top of the matrix. 

(f) L — condition: A(R, J, m) is said to satisfy the condition ifO<J<i?&0<m< Cj£. 

(g) Matrices with equal- weight-columns/rows: Matrices have equal- weight columns and 
quasi-equal-weight rows. In other words, in these matrices, each column has the same number 
of l's and each row has almost the same number of l's. (The difference of the numbers of l's 
is less than 1.) We will also call such a matrix a (weighted) Balanced matrix. 

We also define the following special A(R, J, m) that will keep the rows with more l's at the top 
of the array: 
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(1) lfm = 0, A(R,J,m) = 0. 

(2) If J = 0, A(R, J, m) = (0, 0) T . 

(3) If J = R, A(R, J, m) = (1, 1) T . 

(4) If m = 1, A(R,J,m) = (1, 1,0, ...,0) T , where the number of l's is J. 

(I ... \ 



(5) If J = 1, A(R,J,m) = 








1 




, there are m l's in the matrix. 



V o ... o / 

/ 1 1 ... 



(6) If J = 12-1, A(R,J,m) 



1 ... 1 1 

1 ... 1 

1 ... 1 

V 1 1 ... J 



, there are m l's in the matrix. 



The above matrices are called the ending-states. So we can assume that 2 < J < R — 2 for later 
discussion. 

Let A(R, J,m) satisfy the L-condition. We can represent A(R, J,m) in the following form. 



A(R, J, m) 



1 1 

Ai(R- 1, J- l,mi) A 2 {R- 1, J,m-mi) 



(5) 



Theorem 1 [1] Assume A(R, J,m) satisfy the L-condition. Let mi — ' 
Ai(l? — 1, J — 1, mi) and A2(l? — 1, J, m — mi) also satisfy L-condition. 



], then 



Theorem 2 [1] Suppose that A±(R — 1, J — 1, mi) and A 2 (R — 1, J, m — mi) are Matrices with 
equal- weight-columns/rows, where mi = [*^r- + ^^]- Then, 

A'(R,J,m) = [< 1 > (mi)0 < > (m - mi)] o [Ai(12- 1, J — l,mi)©A 2 (12- 1, J,m-mi)] (6) 

is a matrix with equal- weight-columns/rows. 

The above two theorems indicate a recursive process of generating a matrix with equal-weight 
columns and quasi-equal- weight rows: The Theorem 2 provides the union process that guarantees 
that each row after the union has about the same weight. ©, to place the matrix on the right side 
of the operator upside down, is to avoid one row containing more than one l's after the merge. It 
is the simplest mathematical way of solving the problem. The recursive matrice in (5) is also called 
a recursively balanced matrix. 
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However, by "placing a matrix upside down" and then moving rows with more l's to the top 
of the matrix after merging takes more computational time. In [1], the author made a mistake by 
simply stating that this process is optimum 0(R ■ m). This is because some of the parts in the 
array may need to be "placed upside down" many times (say min{i?, m}). So , in the worst case, 
the complexity of the procedure given in Theorem 2 is 0(R ■ m ■ mm{R, m}). This is still a very 
fast algorithm for generating such a matrix. Other generating algorithms for these check matrices 
were discussed in [11] [5] [6]. However, these algorithms are not in polynomial time. 

It is obvious that if m ■ J is divisible by R, the procedure of "placing a matrix upside down" 
can be ignored. So, 

Corollary 1 Let m- J be divisible by R. Suppose that Ai(R— 1, J— 1, m\) and IS.2{R— 1, J, ra- 
mi) are matrices with equal- weight-columns and equal- weight-rows, where mi = [ m jf + ^^\- Then, 



A'(R,J,m) = [< 1 > (mi)® < > (m - mi)] o [A^R- 1, J- I, mi) © A 2 (R- 1, J,m - mi)]. (7) 

is a matrix with equal- weight-columns and equal- weight-rows. 

Corollary 2 There is 0(m ■ R) algorithm for generating A(R, J,m = Cj0 

Based on the above discussion, we can obtain two algorithms below: 

Algorithm A: the none-recursive algorithm 

Step 1 Check if A(R, J, m) satisfies the L-condition. 
Step 2 Decompose A(R,J,m) into A(Ri, Ji, mi), 
A(i?2, J 2 ,m 2 ), A(R n ,J n ,m n ), 

where each A(i?j, Jj, mj),(i=l,2,...,n) is a ending-state. 
Step 3 Merge A(R i: Ji,mi),(i = 1,2, ...,n) 
using © operator. 

Note: "decomposition" and "merge" use the same objects. 

Algorithm A': the recursive algorithm 

Step 1 Check if A(R, J, m) satisfies L-condition. 

Step 2 Decompose A(R,J,m) into Ai(R— 1, J — l,mi) 

and A(R — 1, J, m — mi). 
Step 3 Merge Ai(R — 1, J — 1, mi) and A(R — 1, J, m — mi) based on (6). 

4 A Better Algorithm for Generating A(R,J,m) 

Even though Theorem 2 did not give an 0(R ■ m) algorithm, it provides a great hint to design a 
(virtually) optimum algorithm. The key is to keep more l's rows at the top of the matrix and to 
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avoid putting more as it needed. 

The following simple calculations can be used to improve the process. We would be able to 
know how many rows have the extra l's in each of Ai (R — 1, J — 1, mi) and A 2 (R — 1, J, m — mi) 
before the merge. After the merge, the array will not change. 

Suppose that Ai(R — 1, J — l,mi) and A 2 (R — 1, J, m — mi) are Matrices with equal-weight- 
columns/rows, where rti\ = Y^jf + ^j^]- 

First, calculate 

n = (J - 1) ■ (mi) (mod R- 1),0 < n < (R- 1), (8) 
r 2 = J-(m-m 1 ) (mod i? - 1), < r 2 < (i? - 1). (9) 

If (ri + r 2 > (R- 1)), let r' = n +r 2 - (-R- 1). Because n < (R- 1) and r 2 < (R- 1), n > r' 
and r 2 > r'. Shift r 2 — r' rows in A 2 to the bottom to obtain a A' 2 . If (r± + r 2 < (R — 1)), then 
move the first r 2 rows to n + 1 to r\ + r 2 , so 

A'(R,J,m) = [< 1 > (mi)© < > (m - mi)] o [A^R- 1, J - 1, mi) © A' 2 (R- 1, J, m - mi)] (10) 

is a matrix with equal- weight-columns/rows. 

The time complexity of the above process is 

T(m) = T(m 1 ) + T(m-m 1 ) + 0(m- R). (11) 

Using the same technique to analyze the average case of Quick-Sort [3], we have 

T{m) = R-m\og{m ■ R) = T{m) =R-m\og{m). (12) 

For the detail analysis, we could view T(m) as the average time required by the randomized 
mi. That means mi could be any number between 1 to m — 1. So 

T(m) = (T(l) + r(m-l) + T(2) + T(m-2) + ... + T(m-l) + r(l) + 0(i2-m-m))/(m-l), (13) 

T ( m ) = ^T^k=iT(k) + 0(R ■ m). (14) 

Using the method of the mathematical induction, we can prove that T(m) < R - m ■ \og(R ■ m) [3]. 

We know that R < m is held in most of cases. Therefore, we have: 

Theorem 3 The time complexity of generating A(R, J, m) is 0(i?-m(log i?+log m)) for average 
cases, or 0(R- m ■ logm) for most cases. 
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5 More Efficient Algorithms for Generating A(R,J,m)? 



To avoid using extra time to build A(R, J,m), we have to know exactly where to put the extra 
l's if r = J ■ m (mod R) is not zero. Except in the case of m = C J R , even though r = J • m 
(mod R), we may have an unbalanced number of l's at the next level. For example, let R = 6, 
J = 3, and m = 10. ni\ = [(J • m)/R + (R — 1)/R] = 6 and (m — mi) = 6. r = ( J — 1) • mi) 
(mod (i? — 1)) = 12 (mod 5) = 2. Therefore, it seems impossible to design an 0{R ■ m) algorithm 
for generating A(R, J, m). In other words, it is reasonable to say that the best algorithm should use 
the divide-and-conquer to reach the time complexity O(R-mlogm). Moreover, to set up a dividing 
point in the middle of the length of the array might break the L-condition, it seems hard to design 
an algorithm that makes a matrix based on two equal sized arrays without the L-condition. We 
have provided an analysis of average cases in Section 4. We have the reason to believe that the 
algorithms discussed in this note attain the optimal in theory. However, practically one can still 
develop more efficient algorithms for generating such a matrix. 



6 Conclusion 

This note translates the major steps of the author's earlier paper written in Chinese for optimal 
Hsiao codes. An algorithm analysis error was found and corrected. The original algorithm has been 
modified and improved. A brief analysis shows that the algorithm reached optimum in average cases 
if a divide-and-conquer technique was involved in the algorithm. 
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Appendix 



Zbl 0629.65046 
Chen, Li 

An optimal generating algorithm for a matrix of equal-weight columns and quasi- 
equal-weight rows. (Chinese. English summary) [J] J. Nanjing Inst. Technol. 16, 
No.2, 33-39 (1986). [ISSN 0254-4180] 

According to a recursive matrix which is introduced in this paper, an algorithm for a 
matrix of equal-weight columns which are inequal each other and quasi-equal-weight 
rows is given. The algorithm is optimal in time and space, and it can be applied 
to generate check matrices of optimal minimum odd-weight-column SEC-DED codes. 
Finally, this algorithm is extended to a set of n elements. 

MSC 2000: *65F30 Other matrix algorithms 

Keywords: recursive matrix; equal-weight columns; quasi-equal-weight rows; optimal 
minimum odd-weight-column SEC-DED codes 
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